Implicit Measurement of Intelligibility of Male and Female Voice Text-to-speech (tts) Synthesis in Noise Using a Phoneme Detection Task

نویسندگان

  • Nicole Lees
  • Catherine Stevens
  • Julie Vonwiller
چکیده

ABSTRACT: Given the increasing application of TTS synthesis in commercial and clinical settings, there is a need to develop methods of evaluation from the user’s perspective. An experiment is reported that compares the effect of two factors, voice gender and signal quality, on the intelligibility of three TTS systems from the user’s point of view. It was hypothesised that male voiced TTS would be more intelligible than female voiced TTS and that low quality signals would reduce intelligibility. Accuracy scores were obtained from a phoneme detection task using Semantically Unpredictable Sentences (SUS). The results indicate that an interaction between voice gender and signal quality is dependent on the TTS system. The results suggest that intelligibility, from the user’s point of view, is modulated by several factors. Consequently there is a need to evaluate TTS systems in the context of commercial requirements and specific applications.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Experimental tools to evaluate intelligibility of text-to-speech (TTS) synthesis: effects of voice gender and signal quality

Two experiments are reported that constitute new methods for evaluation of text-to-speech (TTS) synthesis from the user’s perspective. Experiment 1, using sentence stimuli, and Experiment 2, using discrete word stimuli, investigate the effect of voice gender and signal quality on the intelligibility of three TTS synthesis systems from the user’s point of view. Accuracy scores and reaction time ...

متن کامل

On-line experimental methods to evaluate text-to-speech (TTS) synthesis: effects of voice gender and signal quality on intelligibility, naturalness and preference

Three experiments are reported that use new experimental methods for the evaluation of text-to-speech (TTS) synthesis from the user’s perspective. Experiment 1, using sentence stimuli, and Experiment 2, using discrete ‘‘call centre’’ word stimuli, investigated the effect of voice gender and signal quality on the intelligibility of three concatenative TTS synthesis systems. Accuracy and search t...

متن کامل

Speech Synthesis System for Telugu Language

A system which takes input as a sequence of words and converts them to speech. Vowels and consonants are most important in Telugu language. The voices are sampled from real recorded speech. The speech synthesis is handheld by computers and mobile phones. To build a natural sounding speech synthesis system, it is essential that text processing component produce an appropriate sequence of phonemi...

متن کامل

Text-To-Speech Intelligibility Across Speech Rates

A web-based listening test measured intelligibility across speech rate of 8 TTS systems and of a linearly timecompressed human speech reference voice. The synthesis systems included 2 independent representatives of each of the following 4 synthesis methods: formant, diphone concatenation, unit selection concatenation, and HMM. For each TTS system, a female and a male American English voice were...

متن کامل

Automatic word stress marking and syllabification for Catalan TTS

Stress and syllabification are essential attributes for several components in text-to speech (TTS) systems. They are responsible for improving grapheme-to-phoneme conversion rules and for enhancing the synthetic intelligibility, since stress and syllable are key units in prosody prediction. This paper presents three linguistically rule-based automatic algorithms for Catalan text-to-speech conve...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002